Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
Identifieur interne : 003977 ( Main/Exploration ); précédent : 003976; suivant : 003978Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
Auteurs : Georges Quénot [France] ; Tien Ping Tan [France] ; Viet Bac Le [France] ; Stéphane Ayache [France] ; Laurent Besacier [France] ; Philippe Mulhem [France]Source :
- Multimedia Tools and Applications [ 1380-7501 ] ; 2010-05-01.
English descriptors
- KwdEn :
Abstract
Abstract: We present in this paper an approach based on the use of the International Phonetic Alphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents. The approach works even if the languages of the document are unknown. It has been validated in the context of the “Star Challenge” search engine competition organized by the Agency for Science, Technology and Research (A*STAR) of Singapore. Our approach includes the building of an IPA-based multilingual acoustic model and a dynamic programming based method for searching document segments by “IPA string spotting”. Dynamic programming allows for retrieving the query string in the document string even with a significant transcription error rate at the phone level. The methods that we developed ranked us as first and third on the monolingual (English) search task, as fifth on the multilingual search task and as first on the multimodal (audio and image) search task.
Url:
DOI: 10.1007/s11042-009-0377-6
Affiliations:
- France
- Auvergne-Rhône-Alpes, Provence-Alpes-Côte d'Azur, Rhône-Alpes, Île-de-France
- Grenoble, Marseille, Orsay
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000325
- to stream Istex, to step Curation: 000323
- to stream Istex, to step Checkpoint: 000A84
- to stream Main, to step Merge: 003A55
- to stream Main, to step Curation: 003977
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet</title>
<author><name sortKey="Quenot, Georges" sort="Quenot, Georges" uniqKey="Quenot G" first="Georges" last="Quénot">Georges Quénot</name>
</author>
<author><name sortKey="Tan, Tien Ping" sort="Tan, Tien Ping" uniqKey="Tan T" first="Tien Ping" last="Tan">Tien Ping Tan</name>
</author>
<author><name sortKey="Le, Viet Bac" sort="Le, Viet Bac" uniqKey="Le V" first="Viet Bac" last="Le">Viet Bac Le</name>
</author>
<author><name sortKey="Ayache, Stephane" sort="Ayache, Stephane" uniqKey="Ayache S" first="Stéphane" last="Ayache">Stéphane Ayache</name>
</author>
<author><name sortKey="Besacier, Laurent" sort="Besacier, Laurent" uniqKey="Besacier L" first="Laurent" last="Besacier">Laurent Besacier</name>
</author>
<author><name sortKey="Mulhem, Philippe" sort="Mulhem, Philippe" uniqKey="Mulhem P" first="Philippe" last="Mulhem">Philippe Mulhem</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:0EA989FF22B3EF27C120F806C0ADBC5AB14EB9A6</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/s11042-009-0377-6</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-M0BLH6PC-7/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000325</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000325</idno>
<idno type="wicri:Area/Istex/Curation">000323</idno>
<idno type="wicri:Area/Istex/Checkpoint">000A84</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000A84</idno>
<idno type="wicri:doubleKey">1380-7501:2009:Quenot G:content:based:search</idno>
<idno type="wicri:Area/Main/Merge">003A55</idno>
<idno type="wicri:Area/Main/Curation">003977</idno>
<idno type="wicri:Area/Main/Exploration">003977</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet</title>
<author><name sortKey="Quenot, Georges" sort="Quenot, Georges" uniqKey="Quenot G" first="Georges" last="Quénot">Georges Quénot</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique de Grenoble, BP 53, 38041, Grenoble Cedex 9</wicri:regionArea>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Grenoble</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Tan, Tien Ping" sort="Tan, Tien Ping" uniqKey="Tan T" first="Tien Ping" last="Tan">Tien Ping Tan</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique de Grenoble, BP 53, 38041, Grenoble Cedex 9</wicri:regionArea>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Grenoble</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Le, Viet Bac" sort="Le, Viet Bac" uniqKey="Le V" first="Viet Bac" last="Le">Viet Bac Le</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>LIMSI-CNRS, BP 133, 91403, Orsay Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Orsay</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Ayache, Stephane" sort="Ayache, Stephane" uniqKey="Ayache S" first="Stéphane" last="Ayache">Stéphane Ayache</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique Fondamentale de Marseille, 163 avenue de Luminy - Case 901, 13288, Marseille Cedex 9</wicri:regionArea>
<placeName><region type="region" nuts="2">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Besacier, Laurent" sort="Besacier, Laurent" uniqKey="Besacier L" first="Laurent" last="Besacier">Laurent Besacier</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique de Grenoble, BP 53, 38041, Grenoble Cedex 9</wicri:regionArea>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Grenoble</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Mulhem, Philippe" sort="Mulhem, Philippe" uniqKey="Mulhem P" first="Philippe" last="Mulhem">Philippe Mulhem</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique de Grenoble, BP 53, 38041, Grenoble Cedex 9</wicri:regionArea>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Grenoble</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Multimedia Tools and Applications</title>
<title level="j" type="sub">An International Journal</title>
<title level="j" type="abbrev">Multimed Tools Appl</title>
<idno type="ISSN">1380-7501</idno>
<idno type="eISSN">1573-7721</idno>
<imprint><publisher>Springer US; http://www.springer-ny.com</publisher>
<pubPlace>Boston</pubPlace>
<date type="published" when="2010-05-01">2010-05-01</date>
<biblScope unit="volume">48</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="123">123</biblScope>
<biblScope unit="page" to="140">140</biblScope>
</imprint>
<idno type="ISSN">1380-7501</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">1380-7501</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Audio retrieval</term>
<term>Dynamic programming</term>
<term>International Phonetic Alphabet</term>
<term>Multilingual</term>
<term>Star Challenge</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: We present in this paper an approach based on the use of the International Phonetic Alphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents. The approach works even if the languages of the document are unknown. It has been validated in the context of the “Star Challenge” search engine competition organized by the Agency for Science, Technology and Research (A*STAR) of Singapore. Our approach includes the building of an IPA-based multilingual acoustic model and a dynamic programming based method for searching document segments by “IPA string spotting”. Dynamic programming allows for retrieving the query string in the document string even with a significant transcription error rate at the phone level. The methods that we developed ranked us as first and third on the monolingual (English) search task, as fifth on the multilingual search task and as first on the multimodal (audio and image) search task.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Auvergne-Rhône-Alpes</li>
<li>Provence-Alpes-Côte d'Azur</li>
<li>Rhône-Alpes</li>
<li>Île-de-France</li>
</region>
<settlement><li>Grenoble</li>
<li>Marseille</li>
<li>Orsay</li>
</settlement>
</list>
<tree><country name="France"><region name="Auvergne-Rhône-Alpes"><name sortKey="Quenot, Georges" sort="Quenot, Georges" uniqKey="Quenot G" first="Georges" last="Quénot">Georges Quénot</name>
</region>
<name sortKey="Ayache, Stephane" sort="Ayache, Stephane" uniqKey="Ayache S" first="Stéphane" last="Ayache">Stéphane Ayache</name>
<name sortKey="Besacier, Laurent" sort="Besacier, Laurent" uniqKey="Besacier L" first="Laurent" last="Besacier">Laurent Besacier</name>
<name sortKey="Le, Viet Bac" sort="Le, Viet Bac" uniqKey="Le V" first="Viet Bac" last="Le">Viet Bac Le</name>
<name sortKey="Mulhem, Philippe" sort="Mulhem, Philippe" uniqKey="Mulhem P" first="Philippe" last="Mulhem">Philippe Mulhem</name>
<name sortKey="Quenot, Georges" sort="Quenot, Georges" uniqKey="Quenot G" first="Georges" last="Quénot">Georges Quénot</name>
<name sortKey="Tan, Tien Ping" sort="Tan, Tien Ping" uniqKey="Tan T" first="Tien Ping" last="Tan">Tien Ping Tan</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003977 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003977 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:0EA989FF22B3EF27C120F806C0ADBC5AB14EB9A6 |texte= Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet }}
This area was generated with Dilib version V0.6.33. |